A Cost-Aware and Workload-Based Index Advisor for Columnar In-Memory Databases
نویسندگان
چکیده
Optimal index configurations for in-memory databases differ significantly from configurations for their traditional disk-based counterparts. Operations such as full column scans that have previously been prohibitively expensive in disk-based and row-oriented databases are now computationally feasible with columnar main memory-resident data structures and even outperform index-based accesses in many cases. Furthermore, index selection criteria are different for in-memory databases since maintenance costs are often lower while memory footprint considerations have become increasingly important. In this paper, we introduce a workload-based and cost-aware index advisor tailored for columnar in-memory databases in mixed workload environments. We apply a memory traffic-driven model to estimate the efficiency of each index and to give a system-wide overview of the indices that are cost-ineffective with respect to their size and performance improvement. We also present our Index Advisor Cockpit applied to a real-world live production enterprise system of a Global 2000 company.
منابع مشابه
A Storage Advisor for Hybrid-Store Databases
With the SAP HANA database, SAP offers a high-performance in-memory hybrid-store database. Hybrid-store databases—that is, databases supporting rowand column-oriented data management—are getting more and more prominent. While the columnar management offers high-performance capabilities for analyzing large quantities of data, the row-oriented store can handle transactional point queries as well ...
متن کاملAn Adaptive Aggregate Maintenance Approach for Mixed Workloads in Columnar In-Memory Databases
The mixed database workloads generated by enterprise applications can be categorized into shortrunning transactional as well as long-running analytical queries with resource-intensive data aggregations. The introduction of materialized views can accelerate the execution of aggregate queries significantly. However, the overhead of materialized view maintenance has to be taken into account and va...
متن کاملDevelopment and Validation of a Pilot Activity Load Index (PALI) based on NASA-TLX template
Abstract Introduction: Workload can be defined as the hypothetical construct that represents the cost incurred by a human operator to achieve a particular level of performance. Each job has specific needs and demands. The better measurement tool assessing that estimate the workload, it’s need to identify the requirements of a task, the circumstances under which it is performed, and the skills,...
متن کاملWiSeDB: A Learning-based Workload Management Advisor for Cloud Databases
Workload management for cloud databases deals with the tasks of resource provisioning, query placement, and query scheduling in a manner that meets the application’s performance goals while minimizing the cost of using cloud resources. Existing solutions have approached these three challenges in isolation while aiming to optimize a single performance metric. In this paper, we introduce WiSeDB, ...
متن کاملUsing Object-Awareness to Optimize Join Processing in the SAP HANA Aggregate Cache
The introduction of columnar in-memory databases, along with hardware evolution, has made the execution of transactional and analytical workloads on a single system both feasible and viable. Yet, doing analytics directly on the transactional data introduces an increasing amount of resourceintensive aggregate queries which can slow down the overall system performance in a multi-user environment....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016